A MONOTONICITY PROPERTY OF VARIANCES 



J. M. ALDAZ 

Abstract. We prove that variances of non- negative random variables have the following 
monotonicity property: For all < r < s < 1, and all < X g L 2 , we have Var(A r ) 1//r < 
Var(A s ) 1/s - We also discuss the real valued case. 



1. Introduction 

Here, statements such as X > or X = Y, are always meant in the almost sure sense. It is 
immediate from either Holder's or Jensen's inequality that for every random variable X > 
and all < r < s < oo, we have (EX r ) 1 ^ r < (EX 3 ) 1 ^. In this note we obtain an analogous 
result for non-negative random variables X 6 L 2 and variances. As in the case of norms, 
this inequality helps to clarify the strength of hypotheses that might be made on Var(X r ). 
An application to a recent refinement of the AM-GM inequality YYi=i X T — SlLi a i x i is 
presented. Lastly, this monotonicity property can be used when dealing with real valued 
random variables, by decomposing them into their positive and negative parts, since the 
variance of X is always comparable to the sum of the variances of X + and X_. 

2. Monotonicity of Var(X s ) 1/s , and the AM-GM inequality. 

Let < X e L 2 , so Var(X) is well defined. Since for all < s < 1 we have ||X|| 2s < H^lhi 
all variances Var(X s ) are also well defined, and thus it is natural to ask how these quantities 
behave as s changes. In order to be able to compare them, we need to have the same 
homogeneity on both sides of the inequality, so we consider Var(A AS ) 1/ ' s , which always is 
homogeneous of order 2: For all t > 0, Var((tX) s ) 1 / s = t 2 Var(X s ) 1 / s . 

Theorem 2.1. Let < X e L 2 and let < r < s < 1. Then 



(1) Var(X r ) 1/r < Var(X s ) 1/s . 

Proof. Observe first that it is enough to prove the case Var(X s ) 1 / s < Var(X) whenever 
< s < 1. The fact that Var(X s ) 1//s is increasing in s then follows immediately by making 
the change of variables Y = X s : Var(X r ) s / r = Yai{Y r / s ) s ' r < Var(F) = Var(X s ). 

Next, we assume that ||X||2 = 1. This can be done by homogeneity, since writing Y = 
X/||X|| 2 , we see that VarfX 5 ) 1 / 5 < Var(X) is equivalent to Var(F s ) 1 / s < Var(Y). Under the 
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condition ||X|| 2 = 1, we always have, for every < s < 1 and every t > 0, ||X||| S < 1, and 
hence, Var(X s )' < 1. 

We shall use the following well known (and direct) interpolation consequence of Holder's 
inequality (cf., for instance, |Fot Proposition 6.10, p. 177]) which is valid for both finite and 
infinite measure spaces: If < r < s < p, and / G U (1L P , then / belongs to all intermediate 
spaces L s , and furthermore, ||/|| s < where t G (0, 1) is defined by the equation 

1/s = (1 — t)/r + t/p. Using the indices < s < 2s < 2, together with ||X|| 2 = 1, yields 
t = 1/(2 -s) and 

(2) E{X 2s ) < (EX s Y 2 - 2s W- s \ 
while the indices < s < 1 < 2 give t = (2 — 2s) /(2 — s) and 

(3) E(X) < (EX s ) 1/{2 - s) . 

Now, by the preceding assumptions on the size of norms and variances (in particular, by 
||X S ||2 = H^llls < 1) together with 1/s > 1, we have 

X s \ , r f X s \ (EX 3 ) 2 



Var(X s ) 1/s < Var(X s ) = \\X S \\ 2 2 Var ( n^L ) < Var ( 



Thus, it suffices to show that 

(EX S ) 2 



X S \\ 2 J E(X 2s ) 
< Var(X) = 1 - (EX) 2 , 



E(X 2s j 
or equivalently, that 

(EX) 2 E(X 2s ) < (EX S ) 2 . 
But this follows from ()3]) and (j5J), since 

(EX) 2 E(X 2s ) < (EX S ) 2 / ^ 2 ~ s ^ (EX S ) ( 2_2s )/ ( 2 ~ s ) = (EX 3 ) 2 . 

□ 

Remark 2.2. The interpolation result noted above is useful in a probability context since, 
instead of the usual bound ||X|| S < whenever < s < p, it yields the stronger inequality 

ll^lls < ll^llr^'ll^llp for each < r < s, with t defined by 1/s = (1 - t)/r + t/p. 

Of course, under different integrability conditions (X G LP instead of X G L 2 ) the analogous 
inequalities hold, by using the change of variables Y = X y l 2 G L 2 . 

Corollary 2.3. Let p>0, let < X G VP, and let < r < s < p/2. Then 



(4) Var(X r ) 1/r < Var(X s ) 1/s . 

Next we apply the preceding result to a recent refinement of the inequality between 
arithmetic and geometric means (the AM-GM inequality) proven in |Alj (the reader in- 
terested in some probabilistic aspects of the AM-GM inequality, may want to consult |A3] 
and the references contained therein; for non- variance bounds, see [A4] and its references). 
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Let us recall the notation used in |A1] : X denotes the vector with non- negative entries 
(xi, . . . , x n ), and X 1 ! 2 = (x^ 2 , . . . , xl/ 2 ). Given a sequence of weights a = (oti, . . . , a n ) with 
«i > and Ym=i a i = 1) an d a vector Y" = (yi, . . . , y n ), its arithmetic mean is denoted by 
E a (Y) := J2i=i a iVii ^s geometric mean, by H a (Y) := niLil/fS an d its variance, by 

Var a (Y) = ^2 a i [ Vi ~ X] ) = a ^ ~ ( afe2/fc 

=1 V k=l J i=l \fc=l 

Finally, Ym ax and respectively stand for the maximum and the minimum values of Y. 

Conceptually, variance bounds for E a X — Tl a X represent the natural extension of the 
equality case in the AM-GM inequality (zero variance is equivalent to equality). From a more 
applied viewpoint, the variance is used in the Economics literature to estimate the difference 
between these means (cf., for instance, [SH Chapter 1, Appendix 2]; both the arithmetic and 
geometric means are used when reporting on the performance of a portfolio). 

The bounds for the difference in the AM-GM appearing in [AlJ involve Var(X 1//2 ), rather 
than cr(X) = Var a (X) 1 / 2 . Using Theorem 12.11 or Corollary 12. 3[ the following upper bound 
follows: E a X — H a X < —^—a(X). More generally, by putting together [Alt Theorem 4.2] 

a min 

with Corollary 12.31 we obtain the next result. 

Theorem 2.4. For n > 2 and i = 1, . . . , n, let X = ( ) be such that Xi > 0, and let 

a = (a±, . . . , a n ) satisfy «j > and XT=i a i = 1- Then for all r G (0, 1] and all s G [1, oo) 
we have 

(5) X&T a (X r ' 2 ) l l r < E a X - U a X < — Vai a (X s / 2 )^ s . 

1 C^min ^min 

These bounds are optimal (cf. \A1\ Examples 2.1 and 2.3]). Theorem 4.2 from |A1] . and 
its proof, were suggested by |CaFit Theorem], which states that if < X min , then 

(6) — ]— Var a (X) < E a X - U a X < — )— Var Q (X). 

A drawback of (jSJ) is that the bounds depend explicitly on X max and X min , something that 
makes it unsuitable for some standard applications, such as, for instance, refining Holder's 
inequality (see |A1] for more details). Of course, since the variance is homogeneous of degree 
2, dividing by X max and X min in (jSJ), gives the left and right hand sides the same homogeneity 
as the middle term. We also point out that the inequality Var Q (X 1 / 2 ) < E a X — n Q X, 
appeared in |A2[ Theorem 1]; this inequality is trivial, useful, and as n — > oo, asymptotically 
optimal, since (1 — a m i n ) _1 — > 1. 

3. Real valued random variables. 

The monotonicity result applies to X > only: If X < with positive probability, then 
X s may fail to be defined as a real valued function, for certain values of s > 0. While trivially 
Var(X) > Var(|X|), in general these two quantities are not comparable, so it is not possible 
to simply replace X with |X|. However, monotonicity can be used on Var(X + ) and Var(X_), 
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where X + := max{X, 0} and X_ := — min{X, 0} denote the positive and negative parts of X, 
respectively. Thus, indirectly it also applies to Var(X), since the latter is indeed comparable 
to Var(X + ) + Var(X_). We have not found this result in the literature, so we include it here 
for completeness. Essentially, the next theorem says that 

Var(X+) + Var(X_) < Var(X) < 2 (Var(X + ) + Var(X_)) , 

and the extremal cases occur, for the left hand side inequality, when either X > or X < 0, 
and for the right hand side inequality, when X = c(1d — 1d c ), where c G R and D is a 
measurable set. 

Theorem 3.1. Let X e L 2 be real valued, and denote by B the sub-a -algebra 

B := {0, n, {X > 0}, {X = 0}, {X < 0}}. 

Then 

(7) Var(X + ) + Var(X_) < Var(X) 

(8) < Var(X+) + Var(X_) + Var(£(X + |£)) + Var(£(X_|£)) < 2 (Var(X + ) + Var(X_)) . 

Furthermore, equality holds in the first inequality if and only if either X > or X < 0; in 
the second, if and only if either X > 0, or X < 0, or < P({X > 0}), < P{{X < 0}), 
= P({X > 0}), and E(X + \{X > 0}) = £(X_|{X < 0}); and m the third, if and only if 
X = E(X\B). 

Proof. The first inequality follows directly from the definitions, the second, from the convexity 
of 4>{x) = x 2 , and the third, from the law of total variance. More precisely, 

Var(X + ) + Var(X_) < Var(X + ) + Var(X_) + 2EX+EX. 

= E{X\) - (EX + ) 2 + E(X 2 ) - (£X_) 2 + 2EX + EX_ = E(X 2 ) - (EX + - EX^) 2 = Var(X), 

and we have equality if and only if EX + EX_ = 0, which happens if and only if either X > 
or X < 0. 

Since, as we just saw, Var(X) = Var(X + ) + Var(X_) + 2EX + EX_, to prove the middle 
inequality in ([7])-([H]), it is enough to show that 

(9) 2£X+£X_ < Var(£(X+|£)) + Var(£(X_|B)). 
Observe that if either X > or X < 0, then 

2EX + EX^ = 0, 

and if additionally either X > or X < 0, then 

= Var(£(X+|£)) + Var(£(X_|23)). 

Next, assume that both A := P{X > 0} > and B := P{X < 0} > 0, and write C := 
P{X = 0}, soO < A + B = 1-C < 1. Then E(X\B) takes exactly two values different from 
0, say E(X\B) = a > on {X > 0}, and E(X\B) = -b < on {X < 0}. With this notation, 
in order to obtain the middle inequality it suffices to show that 

2EX + EX_ = 2AaBb < Var (E(X+\B)) + Var (E(X_\B)) = Aa 2 - (Aa) 2 + Bb 2 - (Bb) 2 , 
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or equivalently, that 

(Aa + Bb) 2 < Aa 2 + Bb 2 . 
But this is follows from the convexity of 4>(x) = x 2 , since 

(Aa + Bb) 2 = (A + B) 2 ( -^—a + -r^—b] " < (A + B) 2 ( — —a 2 + -r^b 2 ) 
v ; v ; \A + B A + B J ~ \A + B A + B J 

= (A + B) (Aa 2 + Bb 2 ) < Aa 2 + Bb 2 . 

Furthermore, (Aa + Bb) 2 = Aa 2 + Bb 2 if and only if both a = b (by the strict convexity of 0) 
and A + B = 1. 

Finally, the law of total variance Var(X) = Var (E(X\B)) + E (Var(X|i3)), applied to both 
X + and X_, tells us that Var(X+) > Var (E(X + \B)) and Var(X_) > Var (E(X_\B)), with 
equality if and only if E (Var(Jf + |0)) = = E (Varpf_|B)), which happens if and only if 
both X + and X_ are constant on {X > 0} and on {X < 0} respectively. This yields the last 
inequality, together with the equality condition X = E(X\B). □ 

Remark 3.2. Instead of B = {0, Q, {X > 0},{X = 0},{X < 0}}, either of the simpler 
algebras B x = {0, Q, {X > 0}, {X < 0}} or B 2 = {0, Q, {X > 0}, {X < 0}} could have been 
used in the preceding theorem, and the inequalities stated there would still hold. But the 
equality conditions would be less symmetric. For instance, if X > 0, then B\ is trivial up 
to sets of measure zero (that is, as a measure algebra), so E(X + \B\) = EX + = EX, and 
Var (E(X + \Bi)) = 0. Thus, the middle inequality in ©-((8]), is actually an equality in this 
case. However, if X = -1 D < 0, where < P(D) < 1, then X = X- = E(X-\Bi), and 
Var(V) < Var(V_) + Var(E(X^\B 1 )) = 2Var(V). 

Corollary 3.3. Let p > 2, let X e LP be real valued, and let < r < 2 < s < p. Then 
Var(X r jY r + Var(X r _ /2 ) 2/r < Var(V) < 2 (Var(xf) 2 / S + Var(X! /2 ) 2 / s ) . 
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